Self-improvement: automation ledger, fact trust, hook analytics, test isolation by ScriptedAlchemy · Pull Request #178 · ScriptedAlchemy/tracedecay

ScriptedAlchemy · 2026-07-01T23:47:05Z

Summary

Findings from a TraceDecay self-audit (session transcript mining + doctor + automation run/fact-proposal logs), fixed in one pass:

Fact-proposal trust rejected wholesale: the session reflector prompt didn't say trust must be numeric, so models emitted "trust": "high" and the validator rejected every proposal. The validator now accepts low/medium/high bucket labels (mapped to 0.15/0.5/0.85) and the prompt states the numeric requirement.
Run-ledger spam: every ~30s scheduler tick appended 3 skipped / scheduler_interval_not_elapsed records (1500+ noise rows). Consecutive identical scheduler skips per task now persist once; manual-trigger skips and reason/task transitions still persist.
Corrupt hook_analytics.jsonl: concurrent hook processes raced a read-modify-rewrite append, merging/dropping lines. Appends now use a single O_APPEND write (PrivateStoreIo::append_line).
Test pollution of the real profile store: branch_db_safety_test.rs ran against the developer's ~/.tracedecay, leaving 111 corrupt branch-meta.json files and ~7k stale registry rows (now repaired locally). The suite now runs under an isolated throwaway home (IsolatedEnv + TraceDecayStorageEnvGuard).

Follow-up commits on this branch (in progress): daemon SIGTERM/SEGV investigation and an automatic post-update health pass for tracedecay update.

Test plan

cargo fmt --all -- --check, cargo clippy --all-targets -- -D warnings
cargo test --test automation_session_reflector_runner_test (7/7)
cargo test --test branch_db_safety_test (5/5, verified no writes to the real ~/.tracedecay during the run)
cargo test --lib automation::lifecycle (4/4, incl. 3 new skip-dedupe tests) and cargo test --lib hooks (36/36)

…nd test isolation - accept low/medium/high bucket trust labels in session reflector fact proposals and clarify the numeric-trust prompt instruction - stop persisting consecutive identical scheduler-skip run records that flooded the automation run ledger every tick - append hook_analytics.jsonl lines via a single O_APPEND write so concurrent hook processes no longer corrupt or drop entries - isolate branch_db_safety_test under a throwaway profile home so it stops writing corrupt branch-meta.json and stale registry rows into the real ~/.tracedecay store

changeset-bot · 2026-07-01T23:47:09Z

⚠️ No Changeset found

Latest commit: 0491064

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

Graceful shutdown persists token counters and checkpoints WALs for every live project server sequentially, which can exceed systemd's stop timeout and end in a SIGKILL mid-checkpoint. Cap shutdown work at 45s, log the timeout outcome, and abort the stalled task; SQLite WAL keeps remaining state crash-safe.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: deb256e427

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-07-01T23:55:22Z

+    _env_lock: tokio::sync::MutexGuard<'static, ()>,
+    storage: TraceDecayStorageEnvGuard,


Keep the env lock until storage guards drop

When an IsolatedEnv is dropped, struct fields are dropped in declaration order, so _env_lock is released before storage restores HOME, TRACEDECAY_DATA_DIR, and the global DB override. If another test in this binary is waiting, it can acquire the lock and install its own isolated env while this guard's TraceDecayStorageEnvGuard then restores the old values over it, defeating the isolation this helper is meant to provide. Declare the lock after storage (or add a custom Drop) so it is dropped last.

Useful? React with 👍 / 👎.

Follow-up: the IsolatedEnv field order was fixed on master in 822d921, and the same drop-order hazard in the mcp-suite fixtures (TestProject, TestEnv, CrossProjectMemoryEnv) is fixed in #198.

…20260701 # Conflicts: # src/automation/runner.rs # tests/automation_session_reflector_runner_test.rs

After refreshing the binary, plugins, and daemon, `tracedecay update` now re-execs a post-update health pass: applies idempotent global-DB schema migrations, quarantines corrupt branch-meta.json files as branch-meta.json.corrupt-<timestamp>, purges stale registry rows under the system temp dir, and summarizes remaining doctor findings. The pass is failure-tolerant (warnings, never update failure) and skippable with --no-heal.

branch_meta now owns the one canonical parse used by both load_branch_meta and the post-update heal quarantine, so schema-corrupt files (valid JSON, wrong shape) are quarantined instead of warning on every open. Restructures the health pass into compute/render, fetches the registry once, makes stale_code_projects borrow with a named StaleRootScope predicate, adds a shared 0o600-at-create private-open helper in PrivateStoreIo, and documents the heal-by-default policy. Adds unit + integration tests for the schema-corrupt quarantine path.

The scheduler gate now loads the run ledger once and threads the records through the run context, so gate-level and post-gate skip dedup share that one read and append_skipped_record is a pure append-unless-repeat with no second I/O pass. Also inlines tokio::time::timeout for the daemon shutdown deadline (a panic in shutdown_all no longer reads as success) and derives the session-reflector trust-label representatives from named memory::trust constants with a drift-guard test.

Moves the update/post-update wiring (plugin refresh, daemon refresh, subprocess re-exec, health pass) into src/update_cmd.rs following the *_cmd convention, bringing main.rs to 871 lines. Also promotes the branch-DB tests' IsolatedEnv into tests/common as the canonical env-isolation helper.

…20260701 # Conflicts: # src/daemon.rs

…20260701 # Conflicts: # src/doctor.rs

…20260701 # Conflicts: # src/main.rs

ScriptedAlchemy added 2 commits July 1, 2026 23:48

docs: generalize WAL wording in daemon shutdown-deadline comment

fec1a0d

chatgpt-codex-connector Bot reviewed Jul 1, 2026

View reviewed changes

ScriptedAlchemy added 9 commits July 1, 2026 23:57

Merge remote-tracking branch 'origin/master' into codex/self-improve-…

47a5caa

…20260701 # Conflicts: # src/automation/runner.rs # tests/automation_session_reflector_runner_test.rs

refactor: deslop and simplify self-improvement branch

d7c773b

Merge remote-tracking branch 'origin/master' into codex/self-improve-…

7402aea

…20260701 # Conflicts: # src/daemon.rs

fix: hold IsolatedEnv lock until storage guard restores env

822d921

ci: baseline over-length subject of published commit deb256e

9673793

ScriptedAlchemy closed this Jul 2, 2026

ScriptedAlchemy reopened this Jul 2, 2026

ScriptedAlchemy added 3 commits July 2, 2026 01:38

chore: retrigger CI after dropped workflow events

2bcd343

Merge remote-tracking branch 'origin/master' into codex/self-improve-…

c52f408

…20260701 # Conflicts: # src/doctor.rs

Merge remote-tracking branch 'origin/master' into codex/self-improve-…

0491064

…20260701 # Conflicts: # src/main.rs

ScriptedAlchemy merged commit a431724 into master Jul 2, 2026
18 checks passed

ScriptedAlchemy mentioned this pull request Jul 2, 2026

fix: hold mcp-suite test env locks until env guards drop #198

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Self-improvement: automation ledger, fact trust, hook analytics, test isolation#178

Self-improvement: automation ledger, fact trust, hook analytics, test isolation#178
ScriptedAlchemy merged 15 commits into
masterfrom
codex/self-improve-20260701

ScriptedAlchemy commented Jul 1, 2026

Uh oh!

changeset-bot Bot commented Jul 1, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jul 1, 2026

Uh oh!

ScriptedAlchemy Jul 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		_env_lock: tokio::sync::MutexGuard<'static, ()>,
		storage: TraceDecayStorageEnvGuard,

Conversation

ScriptedAlchemy commented Jul 1, 2026

Summary

Test plan

Uh oh!

changeset-bot Bot commented Jul 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jul 1, 2026

Choose a reason for hiding this comment

Uh oh!

ScriptedAlchemy Jul 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

changeset-bot Bot commented Jul 1, 2026 •

edited

Loading